Augmented-SVM for gradient observations with application to learning multiple-attractor dynamics
نویسندگان
چکیده
In this chapter we present a new formulation that exploits the principle of Support Vector Machine (SVM). This formulation Augmented-SVM (A-SVM) aims at combining gradient observations with the standard observations of function values (integer labels in classification problems and real values in regression) within a single SVM-like optimization framework. The presented formulation adds onto the existing SVM by enforcing constraints on the gradient of the classifier/regression function. The new constraints modify the original SVM dual, whose optimal solution then results in a new class of support vectors (SV). We present our approach in the light of a particular application in robotics, namely, learning a non-linear dynamical system (DS) with multiple attractors. Non-linear DS have been used extensively for encoding robot motions with a single attractor placed at a predefined target where the motion is required to terminate. In this chapter, instead of insisting on a single attractor, we focus on combining several such DS with distinct attractors, resulting in a multi-stable DS. While exploiting multiple attractors provides more flexibility in recovering from unseen perturbations, it also increases the complexity of the underlying learning problem. We address this problem by augmenting the standard SVM formulation with gradientbased constraints derived from the individual DS. The new SV corresponding to the gradient constraints ensure that the resulting multi-stable DS incurs minimum deviation from the original dynamics and is stable at each of the attractors within a finite region of attraction. We show, via implementations on a simulated 10 degrees of freedom mobile robotic platform, that the model Ashwini Shukla École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland 1015, e-mail: [email protected] Aude Billard École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland 1015, e-mail: [email protected]
منابع مشابه
Augmented-SVM: Automatic space partitioning for combining multiple non-linear dynamics
Non-linear dynamical systems (DS) have been used extensively for building generative models of human behavior. Their applications range from modeling brain dynamics to encoding motor commands. Many schemes have been proposed for encoding robot motions using dynamical systems with a single attractor placed at a predefined target in state space. Although these enable the robots to react against s...
متن کاملGirsanov Based Direct Policy Gradient Methods
Despite the plethora of reinforcement learning algorithms in machine learning and control, the majority of the work in this area relies on discrete time formulations of stochastic dynamics. In this work we present a new policy gradient algorithm for reinforcement learning in continuous state action spaces and continuous time. The derivation is based on successive application of Girsanov’s theor...
متن کاملBuilding Recurrent Neural Networks to Implement Multiple Attractor Dynamics Using the Gradient Descent Method
The present paper proposes a recurrent neural network model and learning algorithm that can acquire the ability to generate desired multiple sequences. The network model is a dynamical system in which the transition function is a contraction mapping, and the learning algorithm is based on the gradient descent method. We show a numerical simulation in which a recurrent neural network obtains a m...
متن کاملAPPLICATION OF THE HYBRID HARMONY SEARCH WITH SUPPORT VECTOR MACHINE FOR IDENTIFICATION AND CALSSIFICATION OF DAMAGED ZONE AROUND UNDERGROUND SPACES
An excavation damage zone (EDZ) can be defined as a rock zone where the rock properties and conditions have been changed due to the processes related to an excavation. This zone affects the behavior of rock mass surrounding the construction that reduces the stability and safety factor and increase probability of failure of the structure. This paper presents an approach to build a model for the ...
متن کاملA New Formulation for Cost-Sensitive Two Group Support Vector Machine with Multiple Error Rate
Support vector machine (SVM) is a popular classification technique which classifies data using a max-margin separator hyperplane. The normal vector and bias of the mentioned hyperplane is determined by solving a quadratic model implies that SVM training confronts by an optimization problem. Among of the extensions of SVM, cost-sensitive scheme refers to a model with multiple costs which conside...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013